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I . INTRODUCTION 

The  forces  occurring  in  a naval  aircraft  ditching  accident  often 
cause  injuries  or  fatalities  to  the  aircraft  occupants.  Impact  of  the 
aircraft  on  the  water  can  produce  serious  injury,  especially  to  the  neck 
and  head,  causing  the  pilot  to  lose  consciousness.  Unfortunately,  fatal 
consequences  may  result  from  such  an  occurrence.  Thus,  it  is  important 
to  be  aware  of  how  the  dynamic  and  physical  variables  involved  in  a 
■«  ditching  accident  relate  to  the  likelihood  of  pilot  injury. 

I’'  * ‘ 

The  U.S.  Navy's  impact  acceleration  research  program  being  conducted 
by  the  Naval  Aerospace  Medical  Research  Laboratory  (NAMRL)  Detachment  is 
engaged  in  experimentation  concerning  dynamic  response  of  the  human  and 
simian  head/neck  system  as  a function  of  motion  and  anthropometric 

: 

parameters.  The  development  of  a model  based  primarily  on  the  informa- 
tion from  the  experimental  data  allows  important  statistical  inferences 
to  be  made  concerning  injury  probability  as  a function  of  dynamic  and 
physical  variables.  Two  previous  technical  reports  [2,  4]  have  discussed 
model  formulation  and  accuracy,  respectively. 

This  report  addresses  itself  to  the  extraction  of  important  infer- 
ences using  a logistic  function  to  model  injury  likelihood.  As  an 
example,  inferences  are  made  for  a logistic  model  constructed  from  the 
data  obtained  in  the  Monte  Carlo  simulation  study  described  in  a recent 
technical  report  [4].  Specific  information  on  how  the  data  was  simulated 
can  be  obtained  in  that  report.  Since  the  data  was  simulated,  the  true 
underlying  model  parameters  are  known.  Therefore,  this  data  base  allows 
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an  assessment  of  the  accuracy  of  the  inference  procedures  derived.  The 
inference  of  foremost  importance  in  this  study  is  the  estimation  of 
threshold  levels  which  allow  for  only  a small  prespecified  chance  of 
head/neck  injury. 
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II.  BACKGROUND 


The  statistical  impact  acceleration  injury  models,  described  in 
the  previous  technical  reports  [2,  3,  4],  may  be  used  to  predict  the 
probability  of  head/neck  injury  for  a given  set  of  anthropometric  and 
dynamic  response  variables.  Due  to  sampling  variability  in  model 
parameter  estimates,  the  prediction  of  the  probability  of  an  injury  for 
a given  set  of  conditions  has  certain  degree  of  error  variability 
associated  with  it.  Inference  procedures  that  account  for  this  variability 
are  used  to  estimate  "safe"  threshold  levels,  i.e.,  threshold  levels  that 
are  reliable  in  keeping  injury  probability  to  an  acceptable  minimum. 

The  logistic  injury  prediction  model  is  of  the  form 

P(x)  - {l  + exp [- (x'_3) ] } 1 

where  x = (1,  x^,  x^, . . . , x^)  denotes  the  set  of  independent 

variables  considered 

and  = (3q,  3^,  S^*-***  8^)  denotes  the  set  of  parameter  values. 

~ * 

If  3.  denotes  the  maximum  likelihood  estimate  of  3.,  then  P(x)  * 

A 

il  + expI-Cx'ji)]}-1  is  an  estimate  of  P(x).  To  derive  the  inference 
procedures  concerning  and  P(x)  that  give  consideration  to  the  variability 

A A A 

of  the  sampling  distribution  of  £ must  be  known.  Because  ^ is  a function 
of  the  data,  it  is  the  outcome  of  a random  vector. 

A 

Choosing  to  be  the  maximum  likelihood  estimate  of  allows  exploita- 
tion of  the  rich  theory  of  maximum  likelihood  estimation  to  obtain  an 

A 

approximation  to  the  distribution  of  j}.  It  can  be  shown,  using  this 
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theory,  that  J3  is  asymptotically  distributed  as  multivariate  normal  with 
mean  vector  3>  and  a certain  variance-covariance  matrix,  £.  The  inherent 
variability  associated  with  inferences  concerning  j?  and  P(x)  are  directly 
related  to  the  variance-covariance  structure  found  in  the  £ matrix.  This 

A 

matrix,  unfortunately,  is  an  unknown  parameter  of  the  distribution  of  j5. 
However,  by  applying  maximum  likelihood  theory  to  the  logistic  model,  it 
is  possible  to  show  that  Z is  a function  of  3..  Thus,  Z_  can  be  estimated 

a 

by  the  substitution  of  for 

A A 

The  FORTRAN  subroutine  of  Jones  [1]  was  used  to  compute  and  Z (the 

A A 

estimate  of  I)  from  the  simulated  data.  The  two  statistics  £ and  £ are 
sufficient  to  form  any  (approximate)  inferences  about  £ and  P(>c)  that  may 
be  desired.  It  should  be  noted  that  as  the  amount  of  data  increases,  the 

A A 

estimates,  8 and  Z,  get  closer  to  the  true  values  of  the  parameters  J3  and 

A 

Z_t  and  the  sampling  distribution  of  J3  becomes  closer  to  multivariate 
normal . 

Estimates  of  and  £ for  models  A and  B (defined  in  a previous  report 
[4])  are  given  in  Figures  1 through  4.  Sample  sizes  of  N = 100  and  N = 1000 
were  used  for  both  models. 
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III.  INTERVAL  ESTIMATION  AND  HYPOTHESIS  TESTING 


Point  estimation  of  j?  and  P(x)  was  discussed  in  previous  technical 
reports  [2,  4].  This  report  is  concerned  with  inferences  about  and 

A 

P(x)  that  account  for  sampling  variability  in  _B.  Point  estimates  do  not 
do  this.  Confidence  interval  estimates  and  tests  of  hypotheses  concern- 

A 

ing  _3  and  P(x)  are  sensitive  to  the  variability  of  so  attention  is 
directed  to  them.  Such  statistical  structures  are  important  in  the 
development  of  safe  threshold  levels. 

A.  INTERVAL  ESTIMATION 

In  this  section  confidence  intervals  are  derived  for  3^  and  for 

A 

P(x),  for  a given  set  of  independent  variables  x>  The  fact  that  £ is 
asymptotically  multivariate  normal  with  mean  vector  and  covariance 
matrix  Z enables  the  derivation  of  an  approximate  1 - a confidence 
interval  for  x'j?  and  for  P(x)  corresponding  to  a given  x vector.  Since, 
asymptotically , 

6 ~ N(8,  £) 

it  follows  that,  asymptotically, 
x'l  ~ N(x'j3,  q2(x)) 

where  q(x)  = (x1 Zx)  . This  in  turn  implies  that,  approximately, 

Prob(x'g  - za^2q(x)  < x’j?  £ x'£  + za^2q(x))  = 1 - a. 
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where  is  the  upper  a point  of  a standard  normal  distribution.  Thus, 
the  outcome  of  this  random  interval  is  an  approximate  1 - a confidence 
interval  for  x’J3. 

The  event 


(x'B  - za^2q(x)  < x’8  < x’B  + za/2ci (£>} 


is  equivalent  ro  the  event 


E = {(1  + exp [- (x'jS  - za^2q(x))])  < P(x)  < (1  + exp[-(x'jS  + 

za/2q«)])"1}. 

Therefore,  the  probability  of  E is  also  1 - a,  approximately.  Thus, 
the  random  interval 

1(1  + exp  [-  (x'j3  - z^^qOO)])  1,  (1  + exp[-(x'jS.  + z^2q(x))])  1] 

(1) 

contains  P(x)  with  probability  approximately  1 - a.  The  outcome  of  this 
random  interval  is  then  an  approximate  1 - a confidence  interval  for 


It  is  also  possible  to  construct  confidence  intervals  for  a given 

A A 

parameter  8^  by  using  and  Z.  Since  8^  is  a maximum  likelihood  estimate 

A 2 
for  8i>  8i  is  asymptotically  normal  with  mean  8i  and  variance  v^,  where 

2 th  ^ 

v^  is  the  (i,  i)  element  of  j^.  This  fact  implies  that 

A /\ 

ProbCB^  - z Vj  < £ Bj^  + za/2vi^  = 1 ~ a»  approximately. 


Thus,  the  random  interval 


(6i  “ Za/2vi*  Si  + za/2vi) 


contains  3^  with  probability  approximately  1 - a.  Therefore,  the  outcome 
of  this  random  interval  is  an  approximate  1 - a confidence  interval  for  3^. 

B.  HYPOTHESIS  TESTING 

A A A 

By  using  3.,  Z and  the  asymptotic  normality  of  hypothesis  tests 
for  the  parameters  P(x)  and  3^  (i  = 0,1,..., k)  may  be  easily  constructed. 

As  previously  stated,  asymptotically, 

x'0.  - N(x'3,  q2(x)) . 

A 

It  follows  that  (x'S.  ~ c)/q(x)  can  be  used  as  an  asymptotic  test  statistic 
for  testing  Hq:  x'j3  c for  instance.  However,  H^:  x'3  > c is  equivalent 
to  Hq:  P(,x)  _>  p0,  where  c = ln(pQ/ (1  - PQ)).  Thus 

[x'l  - ln(pQ/(l  - pQ) ) ] /q (x)  (3) 


is  the  asymptotic  test  statistic  for  testing  hypotheses  relating  P(x) 
to  some  hypothesized  value,  p^.  Because  of  the  asymptotic  normality  of 

A 

x'jJ,  the  test  statistic  for  P(x)  is  asymptotically  normal  with  mean  zero 
and  variance  one  under  the  null  hypothesis,  i.e.,  when  P(x)  = p^. 

The  statistical  structure  of  hypothesis  testing  can  be  used  to  control 
the  probability  of  making  a wrong  assumption  concerning  the  relationship 
of  P(x)  to  Pq.  Consideration  will  be  given  to  the  following  two  hypothesis 
testing  formats,  using  a significance  level  of  a: 
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it  is  important  to  know  how  hypothesis  testing  can  be  used  most  effectively 
in  drawing  safe  conclusions  about  P(x)  in  relation  to  p^.  First  consider 
the  hypothesis  testing  format  in  (a): 

Hq:  P(x)  £ Pq  (null  hypothesis) 

H : P(x)  > p (alternative  hypothesis) 

1 0 

Rejection  of  when  is  true  is  called  a Type  I error.  The  prob- 
ability of  making  such  an  error  is  a.  Failing  to  reject  when  is 
true  is  a Type  II  error.  The  probability  of  making  this  error  depends 
upon  the  sample  size,  values  in  the  alternative  hypothesis,  and  a.  In 
this  testing  format  a Type  II  error  could  be  a dangerous  error,  causing 
the  use  of  unsafe  combinations  of  the  independent  variables,  i.e.,  unsafe 
acceleration  profiles. 

Now  consider  the  hypothesis  testing  format  in  (b): 

Hq:  P(x)  1 Pq  (null  hypothesis) 

H^:  P(x)  < Pq  (alternative  hypothesis) 

Here  the  Type  I and  Type  II  errors  are  reversed  from  that  in  format  (a) . 

The  Type  I error  is  now  the  dangerous  one.  The  structure  of  hypothesis 
testing  is  such  that  one  simply  specifies  the  probability  of  Type  I error, 
whereas  the  probability  of  Type  II  error  may  be  difficult  to  control. 

Thus,  the  hypothesis  testing  format  of  (b)  is  the  safer  one  to  use  in 
making  inferences  concerning  the  size  of  P(x)  in  relation  to  some  specified 

V 

In  some  cases  it  might  be  desired  to  test  whether  a given  parameter 
8^  is  useful  in  the  model,  i.e.,  is  nonzero.  The  hypothesis  test  for  8^ 
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depends  upon  the  fact  that,  asymptotically, 

Si  ~ N(0.,  v2±  ). 

It  follows  that  the  appropriate  test  statistic  is 

(B.  - b)/Vi  (4) 

for  hypotheses  relating  0^  to  some  value  b.  The  test  statistic  is 
asymptotically  normal  with  mean  zero  and  variance  one  under  the  null 
hypothesis,  i.e.,  when  0^  = b. 

Another  valuable  hypothesis  test  is  based  on  a likelihood  ratio 
statistic  which  may  be  used  for  testing  a reduced  model,  i.e.,  for 
testing  whether  or  not  a set  of  p certain  parameters  (p  k)  are  zero. 
This  test  was  used  in  a recent  study  based  on  -Gx  acceleration  data 
from  subhuman  primates  [3J.  The  framework  for  this  test  involves  the 
following  two  hypotheses: 

Hp.:  0.  =0.  = ...  = 0.  =0  (Reduced  model) 

1 x2  xp 

H^:  B^'s  unspecified  (Full  model) 

The  appropriate  test  statistic  is 

A = Lx  - L2  (5) 

where 

is  -2  times  the  logistic  likelihood  using  the  maximum  likelihood 
estimate  obtained  under  and 
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Lj  “2  times  the  logistic  likelihood  using  the  maximum  likelihood 
estimate  obtained  under  H^. 

The  null  hypothesis  is  rejected  at  a significance  level  a if 
2 

A > X (a.  p)»  the  upper  a point  of  a chi-square  distribution  with  p 
degrees  of  freedom. 
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IV . CRITICAL  ENVELOPES 


An  inference  of  much  importance  is  the  assessment  of  P(x),  the  true 
probability  of  injury  for  a given  set  of  conditions  x,  in  relation  to 
some  small  probability  p^.  Such  inferences  can  be  made  by  the  prediction 
of  critical  envelopes  and  by  testing  hypotheses  concerning  P(x).  A 
critical  envelope  can  be  defined  as  the  set  of  all  combinations  of  indepen- 
dent variables  for  which  the  predicted  probability  of  injury  is  less  than 
some  given  amount.  However,  variability  in  the  predicted  probability 
causes  error  variability  in  the  prediction  of  the  critical  envelope.  To 
predict  safer  critical  envelopes  such  variability  must  be  taken  into 
account. 

A A 

If  P(x)  is  the  predicted  probability  of  injury,  then  P(x)  £ Pq  is 
equivalent  to 


x'ji  £ ln[pQ/(l  - pQ)  ] . 


(6) 


Thus,  choosing  the  combination  of  independent  variables  x such  that  (6) 

A 

is  true  is  equivalent  to  choosing  x such  that  P(x)  £ Pq.  Let 

C(p0)  = {x:  x'_3  = lnfp^/Cl  - Pq)J}. 

Then  C(p  ) is  the  critical  envelope  corresponding  to  p . Such  an 
0 0 
envelope,  however,  does  not  adjust  for  the  variability  of  j3.  This  may 

result  in  the  use  of  acceleration  conditions  that  are  unsafe. 

A more  conservative  prediction  of  a critical  envelope  that  leads  to 

the  use  of  safer  conditions  is 
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ca(po)  = {*:  x'l  + zaq(x)  = ln[pQ/(l  - pQ)]}.  (7) 

Utilization  of  combinations  of  the  independent  variables  such  that  x 
is  in  the  critical  envelope  implies  that 

{l  + exp(-[x'£  + zaq(x)])}  1 = pQ.  (8) 

The  left-hand  side  of  equation  (8)  is  the  end  point  of  an  approximate 
1 - a right-sided  confidence  interval  for  the  true  P(x).  Thus  one  can 
be  about  1 - a confident  that  for  any  x in  C^Cpg), 

P(x)  < {l  + exp[-(x'6  + z q (x) ) ] } 1 = p . 

- - -a  - 0 

By  decreasing  a,  confidence  may  be  increased  by  predicting  a more 
conservative  critical  envelope.  The  additional  safety  achieved  by  such  a 
critical  envelope  depends  on  the  fact  that  an  interval  estimate,  rather 
than  a point  estimate,  is  employed.  The  confidence  interval  used  adjusts 
for  the  sampling  variability  of  the  estimated  parameters  in  the  injury 
model.  It  must  be  noted,  however,  that  the  smaller  a is  made,  the  more 
severe  will  be  the  constraints  on  the  permitted  acceleration  profiles. 

In  the  extreme,  selecting  a extremely  close  to  zero  will  result  in  no 
acceleration  permitted.  It  is  clear,  therefore,  that  important  tradeoffs 
will  have  to  be  made  in  establishing  any  critical  envelopes  to  be  adopted 
in  practice. 


V.  EXAMPLES 


This  section  presents  examples  of  the  inference  results  derived 
in  the  previous  section.  The  data  used  in  those  examples  is  taken 
from  the  simulation  study  [4]  previously  mentioned.  In  each  example 
of  inference,  results  for  Models  A and  B are  given  for  sample  sizes 
of  N = 100  and  N = 1000.  Whenever  possible,  the  examples  list  true 
parameter  values  along  side  the  statistical  results  so  that  the  accuracy 
of  the  inference  procedures  can  be  discerned. 

Figure  5 provides  an  example  of  95%  confidence  intervals  for  the 
probability  P(x)  for  a specific  x vector.  This  vector  was  chosen 
because  it  corresponds  to  low  probabilities  (which  tend  to  be  of  main 
interest)  for  both  Models  A and  B.  Figures  6 through  9 present,  for 
both  models  and  sample  sizes,  scatter  plots  of  the  length  of  various 
confidence  intervals  for  P(x)  versus  the  corresponding  true  value  of 
P(x)  for  varying  x vectors.  It  can  be  seen  from  these  scatter  plots 
that  the  length  of  the  confidence  interval  for  P(x)  tends  to  get 
smaller  as  P(x)  gets  closer  to  one  or  zero.  This  is  fortunate  since 
concern  centers  on  small  probabilities  of  injury. 

Figure  10  lists  95%  confidence  intervals  for  parameter  8^  of 
Models  A and  B.  Figures  11  and  12  exhibit  the  results  of  hypothesis 
tests  of  P(x)  and  6^,  respectively.  The  specific  x vector  used  for 
the  confidence  intervals  in  Figure  5 was  also  used  for  the  hypothesis 
test  in  Figure  11  because  it  corresponds  to  low  probabilities. 


Figure  13,  using  Model  B with  a sample  size  of  100,  illustrates 
the  test  of  a reduced  model  defined  by  setting  the  three  smallest  model 
parameters  to  zero  (i.e.,  6^  = 32  * “ 0) • The  three  parameters  are 


Figure  5:  95%  Confidence  Intervals  for  P(x)  with  jc  = 

(1,  -0.0862,  -0.2621,  0.9114,  -0.7660,  -0.1363,  -0.4655) 


Conf idence 


Figure  6:  Confidence  Interval  Length  Versus  True  P(x)  Value 
for  Model  A and  Sample  Size  of  100 
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Figure  7:  Confidence  Interval  Length  Versus  True  P(x)  Value 
for  Model  B and  Sample  Size  of  100 
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Figure  8:  Confidence  Interval  Length  Versus  True  P(x)  Value 
for  Model  A and  Sample  Size  of  1000 


Figure  9:  Confidence  Interval  Length  Versus  True  P(x)  Value 
for  Model  B and  Sample  Size  of  1000 
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Figure  10:  95%  Confidence  Intervals  for 
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Figure  11:  Hypothesis  Tests  of  HQ:  P(x)  > 0.05  versus  H : P(x)  < 0.05  (a  = 0.05) 
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shown  to  be  not  statistically  significant  at  the  0.05  significance  level. 

Since  the  actual  critical  envelopes  for  the  simulated  data  are  six- 
dimensional, it  is  impossible  to  exhibit  them  graphically.  However,  some 
information  can  be  provided  by  the  two-dimensional  plots  shown  in  Figures 
14  and  15.  Since  X5  and  xg  are  the  two  strongest  predictor  variables, 
they  were  allowed  to  vary  to  form  plots,  while  x^,  X2,  x^,  and  x^  were 
held  fixed  at  the  values  used  in  the  previous  inferences  concerning  P(x). 
The  two-dimensional  projection  of  the  critical  envelope  (a  straight  line) 
derived  in  [3]  is  plotted  on  the  same  graph  as  the  corresponding  two- 
dimensional  safer  critical  envelope  (a  curved  line)  derived  in  this 
technical  report.  Figure  14  illustrates  the  two-dimensional  critical 
envelope  plots  corresponding  to  Model  B with  a = 0.05,  Pq  = 0.10,  and  a 
sample  size  of  100  observations.  Figure  15  illustrates  similar  plots 
except  that  the  sample  size  was  increased  to  1000  observations.  Note 
that  as  the  sample  size  increases  from  100  to  1000  observations,  the  two 
different  types  of  critical  envelope  become  closer.  The  main  factors 
influencing  the  shape  of  the  critical  envelopes  derived  in  this  report  are: 

(1)  the  confidence  level  of  the  corresponding  one-sided  confidence 
interval,  a, 

(2)  the  critical  envelope  probability,  Pq, 

(3)  the  statistical  significance  of  the  two  plotted  predictor 
variables, 

(4)  the  structure  of  the  variance-covariance  matrix  of  j5, 
and  (5)  the  sample  size  used  to  estimate  j3. 
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VI.  DISCUSSION 


The  statistical  inference  procedures  discussed  in  this  technical 
report  account  for  variability  in  the  data.  Because  these  procedures 
are  asymptotic,  they  are  more  accurate  for  large  samples  than  for  small 
samples.  Nonetheless,  the  use  of  these  inference  procedures  in  computing 
injury  threshold  levels  is  far  safer  than  the  use  of  point  estimates, 
which  do  not  account  for  sampling  error. 

The  inferences  described  in  this  report  were  formulated  through 
the  use  of  maximum  likelihood  theory.  Such  statistical  inferences 
utilize  only  sample  information.  However,  it  may  be  desirable  to  combine 
a researcher's  prior  knowledge  about  an  experiment  with  the  experimental 
sample  information  by  means  of  Bayesian  analysis.  The  assessment  of  a 
prior  multivariate  probability  distribution  for  vector  & could  be  very 
difficult.  However,  for  a given  x vector  of  concern,  an  experimenter's 
prior  knowledge  of  POO  should  be  relatively  easy  to  quantify.  Once  a 
prior  distribution  of  values  for  P(x)  is  obtained,  the  prior  information 
can  be  combined  with  the  sample  information  to  produce  inferences  that 
reflect  both  prior  and  sample  information. 
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